Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines

Identifieur interne : 000193 ( Main/Exploration ); précédent : 000192; suivant : 000194

The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines

Auteurs : Jon Barker [Royaume-Uni] ; Ricard Marxer [Royaume-Uni] ; Emmanuel Vincent [France] ; Shinji Watanabe [États-Unis]

Source :

RBID : Hal:hal-01211376

English descriptors

Abstract

The CHiME challenge series aims to advance far field speech recognition technology by promoting research at the interface of signal processing and automatic speech recognition. This paper presents the design and outcomes of the 3rd CHiME Challenge, which targets the performance of automatic speech recognition in a real-world, commercially-motivated scenario: a person talking to a tablet device that has been fitted with a six-channel microphone array. The paper describes the data collection, the task definition and the base-line systems for data simulation, enhancement and recognition. The paper then presents an overview of the 26 systems that were submitted to the challenge focusing on the strategies that proved to be most successful relative to the MVDR array processing and DNN acoustic modeling reference system. Challenge findings related to the role of simulated data in system training and evaluation are discussed.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines</title>
<author>
<name sortKey="Barker, Jon" sort="Barker, Jon" uniqKey="Barker J" first="Jon" last="Barker">Jon Barker</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-153591" status="VALID">
<orgName>The University of Sheffield [Sheffield]</orgName>
<desc>
<address>
<addrLine>Western Bank Sheffield S10 2TN</addrLine>
<country key="GB"></country>
</address>
<ref type="url">http://www.sheffield.ac.uk/</ref>
</desc>
</hal:affiliation>
<country>Royaume-Uni</country>
</affiliation>
</author>
<author>
<name sortKey="Marxer, Ricard" sort="Marxer, Ricard" uniqKey="Marxer R" first="Ricard" last="Marxer">Ricard Marxer</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-153591" status="VALID">
<orgName>The University of Sheffield [Sheffield]</orgName>
<desc>
<address>
<addrLine>Western Bank Sheffield S10 2TN</addrLine>
<country key="GB"></country>
</address>
<ref type="url">http://www.sheffield.ac.uk/</ref>
</desc>
</hal:affiliation>
<country>Royaume-Uni</country>
</affiliation>
</author>
<author>
<name sortKey="Vincent, Emmanuel" sort="Vincent, Emmanuel" uniqKey="Vincent E" first="Emmanuel" last="Vincent">Emmanuel Vincent</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-420403" status="VALID">
<idno type="RNSR">201421147E</idno>
<orgName>Speech Modeling for Facilitating Oral-Based Communication</orgName>
<orgName type="acronym">MULTISPEECH</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/multispeech</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423086" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423086" type="direct">
<org type="department" xml:id="struct-423086" status="VALID">
<orgName>Department of Natural Language Processing & Knowledge Discovery</orgName>
<orgName type="acronym">LORIA - NLPKD</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/Knowledge-and-Language-Management</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<settlement type="city">Metz</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Watanabe, Shinji" sort="Watanabe, Shinji" uniqKey="Watanabe S" first="Shinji" last="Watanabe">Shinji Watanabe</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-89592" status="VALID">
<orgName>Mitsubishi Electric Research Laboratories</orgName>
<orgName type="acronym">MERL</orgName>
<desc>
<address>
<addrLine>Mitsubishi Electric Research Laboratories Cambridge MA 02139</addrLine>
<country key="US"></country>
</address>
<ref type="url">http://www.merl.com/</ref>
</desc>
<listRelation>
<relation active="#struct-322337" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-322337" type="direct">
<org type="institution" xml:id="struct-322337" status="INCOMING">
<orgName>Mitsubishi Electric Research Laboratories</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>États-Unis</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01211376</idno>
<idno type="halId">hal-01211376</idno>
<idno type="halUri">https://hal.inria.fr/hal-01211376</idno>
<idno type="url">https://hal.inria.fr/hal-01211376</idno>
<date when="2015-12-13">2015-12-13</date>
<idno type="wicri:Area/Hal/Corpus">004C97</idno>
<idno type="wicri:Area/Hal/Curation">004C97</idno>
<idno type="wicri:Area/Hal/Checkpoint">000164</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">000164</idno>
<idno type="wicri:Area/Main/Merge">000193</idno>
<idno type="wicri:Area/Main/Curation">000193</idno>
<idno type="wicri:Area/Main/Exploration">000193</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines</title>
<author>
<name sortKey="Barker, Jon" sort="Barker, Jon" uniqKey="Barker J" first="Jon" last="Barker">Jon Barker</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-153591" status="VALID">
<orgName>The University of Sheffield [Sheffield]</orgName>
<desc>
<address>
<addrLine>Western Bank Sheffield S10 2TN</addrLine>
<country key="GB"></country>
</address>
<ref type="url">http://www.sheffield.ac.uk/</ref>
</desc>
</hal:affiliation>
<country>Royaume-Uni</country>
</affiliation>
</author>
<author>
<name sortKey="Marxer, Ricard" sort="Marxer, Ricard" uniqKey="Marxer R" first="Ricard" last="Marxer">Ricard Marxer</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-153591" status="VALID">
<orgName>The University of Sheffield [Sheffield]</orgName>
<desc>
<address>
<addrLine>Western Bank Sheffield S10 2TN</addrLine>
<country key="GB"></country>
</address>
<ref type="url">http://www.sheffield.ac.uk/</ref>
</desc>
</hal:affiliation>
<country>Royaume-Uni</country>
</affiliation>
</author>
<author>
<name sortKey="Vincent, Emmanuel" sort="Vincent, Emmanuel" uniqKey="Vincent E" first="Emmanuel" last="Vincent">Emmanuel Vincent</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-420403" status="VALID">
<idno type="RNSR">201421147E</idno>
<orgName>Speech Modeling for Facilitating Oral-Based Communication</orgName>
<orgName type="acronym">MULTISPEECH</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/multispeech</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423086" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423086" type="direct">
<org type="department" xml:id="struct-423086" status="VALID">
<orgName>Department of Natural Language Processing & Knowledge Discovery</orgName>
<orgName type="acronym">LORIA - NLPKD</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/Knowledge-and-Language-Management</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<settlement type="city">Metz</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Watanabe, Shinji" sort="Watanabe, Shinji" uniqKey="Watanabe S" first="Shinji" last="Watanabe">Shinji Watanabe</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-89592" status="VALID">
<orgName>Mitsubishi Electric Research Laboratories</orgName>
<orgName type="acronym">MERL</orgName>
<desc>
<address>
<addrLine>Mitsubishi Electric Research Laboratories Cambridge MA 02139</addrLine>
<country key="US"></country>
</address>
<ref type="url">http://www.merl.com/</ref>
</desc>
<listRelation>
<relation active="#struct-322337" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-322337" type="direct">
<org type="institution" xml:id="struct-322337" status="INCOMING">
<orgName>Mitsubishi Electric Research Laboratories</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>États-Unis</country>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>`CHiME' challenge</term>
<term>microphone array</term>
<term>noise-robust ASR</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The CHiME challenge series aims to advance far field speech recognition technology by promoting research at the interface of signal processing and automatic speech recognition. This paper presents the design and outcomes of the 3rd CHiME Challenge, which targets the performance of automatic speech recognition in a real-world, commercially-motivated scenario: a person talking to a tablet device that has been fitted with a six-channel microphone array. The paper describes the data collection, the task definition and the base-line systems for data simulation, enhancement and recognition. The paper then presents an overview of the 26 systems that were submitted to the challenge focusing on the strategies that proved to be most successful relative to the MVDR array processing and DNN acoustic modeling reference system. Challenge findings related to the role of simulated data in system training and evaluation are discussed.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Royaume-Uni</li>
<li>États-Unis</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement>
<li>Metz</li>
<li>Nancy</li>
</settlement>
<orgName>
<li>Université de Lorraine</li>
</orgName>
</list>
<tree>
<country name="Royaume-Uni">
<noRegion>
<name sortKey="Barker, Jon" sort="Barker, Jon" uniqKey="Barker J" first="Jon" last="Barker">Jon Barker</name>
</noRegion>
<name sortKey="Marxer, Ricard" sort="Marxer, Ricard" uniqKey="Marxer R" first="Ricard" last="Marxer">Ricard Marxer</name>
</country>
<country name="France">
<region name="Grand Est">
<name sortKey="Vincent, Emmanuel" sort="Vincent, Emmanuel" uniqKey="Vincent E" first="Emmanuel" last="Vincent">Emmanuel Vincent</name>
</region>
</country>
<country name="États-Unis">
<noRegion>
<name sortKey="Watanabe, Shinji" sort="Watanabe, Shinji" uniqKey="Watanabe S" first="Shinji" last="Watanabe">Shinji Watanabe</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000193 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000193 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:hal-01211376
   |texte=   The third `CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022